Learning Optimal Wavelets from Overcomplete Representations
نویسندگان
چکیده
Efficient and robust representation of signals has been the focus of a number of areas of research. Wavelets represent one such representation scheme that enjoys desirable qualities such as time-frequency localization. Once the Mother wavelet has been selected, other wavelets can be generated as translated and dilated versions of the Mother wavelet in the 1D case. In the 2D case tensor product of two 1D wavelets is the most often used transform. Overcomplete representation of wavelets has proved to be of great advantage, both in sparse coding of complex scenes and multimedia data compression. On the other hand overcompleteness raises a number of technical difficulties for robust computation and systematic generalization of constructions beyond their original application domains. In the following, we concentrate on wavelet decomposition of images, and in particular, the computations are carried out for images of faces. With more effort, video files could be handled with adaptation of our present techniques. In this paper we propose a novel and quite general geometric method to generate overcomplete families that are parameterized by wellbehaved manifolds, in fact Lie groups. The main philosophical motivation for our construction is a statistical adaptation of Felix Klein's Erlanger Program. That is, knowledge about the symmetry groups of objects in a family D (called the data set, such as collection of multimedia data or images) carries much of the key statistical geometric features of D and the distribution of such features for members of D in a statistical sense. We use function theory and the geometric properties of the parameter space to extract sparse representations aiming at optimal use of the local geometric features of members of D. The better the statistical distribution of such geometric features, the more efficient and robust becomes the wavelet representation of members of the data set. Briefly, the theory begins with generating the overcomplete family from the local and global transformations afforded by the data sets, e.g. images. In the second stage, we formulate an "action integral" on the suitable function space associated with the moduli space of wavelets. Finally, variational formulas that minimize "the action" provide the sparse representation of data in the form of special points on the moduli space. We illustrate this method by studying the problem of sparse representation of images of faces in a collection D. In the specific application to compute sparse representation of facial images, we develop a specific learning machine to take advantage of the rather rich statistical regularity of frontal view of face images. Rather than starting with a mother wavelet and searching for an optimal representation of a single image, we start with a set of images and ask for a mother wavelet that gives the optimal representation for this class of images. We choose the Support Vector Machine paradigm as the learning paradigm. Then, our problem is one of estimating an optimal kernel in the Reproducing Kernel Hilbert Space that satisfies the admissibility conditions of wavelets. We show simulations results using a database of human faces using a set of wavelet kernels. We discuss extensions of this approach using a local to global technique for eigenfunction expansion.
منابع مشابه
Overcomplete Frame Thresholding for Acoustic Scene Analysis
In this work, we derive a generic overcomplete frame thresholding scheme based on risk minimization. Overcomplete frames being favored for analysis tasks such as classification, regression or anomaly detection, we provide a way to leverage those optimal representations in real-world applications through the use of thresholding. We validate the method on a large scale bird activity detection tas...
متن کاملLearning Nonlinear Overcomplete Representations for Efficient Coding
We derive a learning algorithm for inferring an overcomplete basis by viewing it as probabilistic model of the observed data. Overcomplete bases allow for better approximation of the underlying statistical density. Using a Laplacian prior on the basis coefficients removes redundancy and leads to representations that are sparse and are a nonlinear function of the data. This can be viewed as a ge...
متن کاملWhen are Overcomplete Representations Identifiable? Uniqueness of Tensor Decompositions Under Expansion Constraints
Overcomplete latent representations have been very popular for unsupervised feature learning in recent years. In this paper, we specify which overcomplete models can be identified given observable moments of a certain order. We consider probabilistic admixture or topic models in the overcomplete regime. While general overcomplete admixtures are not identifiable, we establish {\em generic} ident...
متن کاملSoft-competitive learning of sparse codes and its application to image reconstruction
We propose a new algorithm for the design of overcomplete dictionaries for sparse coding, Neural Gas for Dictionary Learning (NGDL), which uses a set of solutions for the sparse coefficients in each update step of the dictionary. In order to obtain such a set of solutions, we additionally propose the bag of pursuits method (BOP) for sparse approximation. Using BOP in order to determine the coef...
متن کاملLearning-Based Approach to Real Time Tracking and Analysis of Faces
This paper describes a trainable system capable of tracking faces and facial features like eyes and nostrils and estimating basic mouth features such as degrees of openness and smile in real time. In developing this system, we have addressed the twin issues of image representation and algorithms for learning. We have used the invariance properties of image representations based on Haar wavelets...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000